BIOTECHNOLOGY "P?;?^ 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: M flob^. <' 
Date Processed by STIC: \\ [ft %lb 1 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOlR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mail help: natin21hclD@usDto.gov or phone 703-306-41 19 (R, Wax) 
PATENTIN 3,0 e-mail help: natin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW; 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST. 25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://www.uspto.gov/vveb/offices/pac/checker 



BEST AVAltABU COPY 



Raw Sequence Lbtlnr Error Summary 



iliilol 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMDER 

ATTN: NEW RULES CASES: PLEASE DISRECARD ENCLIS1I "ALPHA" HEADERS. WHICH WERE INSERTED D V 

1 Wrapped Nucleic* The numberAcxt at Ihe end or each line "wra pp«T down to the next line. -Jhu may occur if your fi 

Wrapped Aminos was retrieved in a word processor after creating K Please adjuit your right ma/gin to J; this will 
prevent •'wrapping,** * 



! . Invalid Line Length The rulei require that a line not cicced 72 characters in length. *th!i includes white spaces. 



^Misaligned Amino 
Numbering 

Non-ASCII 



_ Variable Length. 



Pa tent I a 3.0 
"bug" 



_S kipped Sequences 
"(OLD RULES) 



The numbering under each 3* amino acid is misaligned Do not use tab codes between numbers; 
use apace characters, instead 

The submitted file was not saved in ASCII(DOS)tcxt, as rx^Hcxi by the Sequence Rules. Plcajc 

ensure your tubae^uent submliaton b aavtd In ASCII text. 

- i ■ * * i 

* " 1 v 

Sequencers) contain n*s or XaVs represent ing more than, one residue. Per Sequence Rules, 
each it or Xaa can only represent a single residue. Please present the majlmum number of each 
residue having variable length and indicate in the <220>*<223> section that some may be missing. 

A "bug** in Patentln version 2 .(This c^njcd uic <220>*<223>*cctjon to be missing from amino acid 

sequencers) . Normally, Patentln would'automaa'c^lfxg'eneratc this section from the 

previously coded nucleic acid sequence. Please manually copy lh« relevant <220>-<323> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequencers) missing. If intentional, please insert the following lines for caeh skipped sequent 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where -X" is 'shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xl) SEQUENCE DESCRJPTION:SEQ ID NO:X: (insert SEf/lD.NO where "X" is shown) 

This sequence is intentionally skjppcd 

Please also adjust the "(it) NUMDER OF SEQUENCES:*' response to Include the skipped sequence: 



Skipped Sequences Sequencers) 
"(NEW RULES) 



^ missing. If Intentional, plejsc insert the following lines for cacli skipped seque 



_Usc of n*i or Xaa'i 
(NEW RULES) 



10 Invalid <2I3> 

Response 




<2I0> seq ucncc id number 
<400> sequence id number 
000 

/ •■ 

Use of n*s and/or Xaa's have been defected in Ihe Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's arc present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa re pre sen 

Per 1.823 of Sequence Rules, the only valid <2I3> responses arc: Unknown, Artificial Sequence, or 
scicntific'rume (Gcnus/spocies). <220>-<223> section is required when <2I3> response is Unknow 
is Artificial Sequence 



12 



Use of <220> 



_Palcntln 2.0 
"bug" 



Sequencers) y y missing the <220> "Feature" and associated numeric identifiers and responses 
Use of <220> to <223> is MANDATORY if <2 13> "Organism" response is "Artificial Sequence* 4 or 
"Unknown." Please explain source of genetic material in <220> td <223> section. 
(Sec "Federal Register" 06/01/1998. Vol. 63. No. I0<, pp. 2963 1-32) (Sec. 1.823 of Sequence Rule 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
lilting). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 ■ • Misuse of n * n tan only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to rcprcs 

any value not specifically a nucleotide. 
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1600 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/286 , 166B 



DATE: 11/08/2001 
TIME: 09:06:19 



Input Set : A:\Seqlist.txt 

Output Set: N:\CRF3\11082001\I286166B.raw 



3 
4 
5 
6 
7 
8 
9 
11 
12 
14 
16 
17 
19 
20 
22 
23 
25 
26 
28 
29 
31 
33 
35 
36 
37 
38 
40 



<110> APPLICANT: Fowlkes, 
Broach, Jim 
Manfredi, John 
Klein, Christine 
Murphy, Andrew J. 
Paul, Jeremy 
Trueheart, Joshua 

<120> TITLE OF INVENTION: 
PROTEIN SURROGATES, 



Dana M. 



YEAST CELLS ENGINEERED TO 
AND USES THEREFOR 



Does Not Comply 
Corrected Diskette Needed 

PRODUCE PHEROMONE 



2ROMONE SYSTEM 



<130> 
<140> 
<141> 
<150> 
<151> 
<150> 
<151> 
<150> 
<151> 
<150> 
<151> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<400> 



FILE REFERENCE: CPI - 012CP4BCN 
CURRENT APPLICATION NUMBER: 09/286, 166B 
CURRENT FILING DATE: 1999-04-05 
PRIOR APPLICATION NUMBER: 08/322,137 

1994-10-13 
NUMBER: 08/309, 313 

1994-09-20 
NUMBER: 08/190, 328 

1994-01-31 
NUMBER: 08/041,431 
1993-03-31 
NOS: 133 



PRIOR 
PRIOR 
PRIOR 
PRIOR 
PRIOR 



FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
NUMBER OF SEQ ID 



2.0 



SOFTWARE: Patentln Ver. 
SEQ ID NO: 1 
LENGTH: 89 
TYPE: PRT 

ORGANISM: Saccharomyces cerevisiae 
SEQUENCE: 1 




41 


Met 


Arg 


Phe 


Pro 


Ser 


He 


Phe 


Thr 


Ala 


Val 


Leu 


Phe 


Ala 


Ala Ser 


Ser 


42 


1 








5 










10 








15 




44 


Ala 


Leu 


Ala 


Ala 


Pro 


Val 


Asn 


Thr 


Thr 


Thr 


Glu 


Asp 


Glu 


Thr Ala 


Gin 


45 








20 










25 










30 




47 


He 


Pro 


Ala 


Glu 


Ala 


Val 


He 


Gly 


Tyr 


Leu 


Asp 


Leu 


Glu 


Gly Asp 


Phe 


48 






35 










40 










45 






50 


Asp 


Val 


Ala 


Val 


Leu 


Pro 


Phe 


Ser 


Asn 


Ser 


Thr 


Asn 


Asn 


Gly Leu 


Leu 


51 




50 










55 










60 








53 


Phe 


He 


Asn 


Thr 


Thr 


He 


Ala 


Ser 


He 


Ala 


Ala 


Lys 


Glu 


Glu Gly Val 


54 


65 










70 










75 








80 


56 


Ser 


Leu 


Asp 


Lys 


Arg 


Glu 


Ala 


Glu 


Ala 














57 










85 























60 <210> SEQ ID NO: 2 

61 <211> LENGTH: 76 

62 <212> TYPE: PRT 

63 <213> ORGANISM: Saccharomyces cerevisiae 

65 <400> SEQUENCE: 2 

66 Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 

67 1 5 10 

69 Ala Glu Ala Glu Ala Trp His Trp Leu Gin Leu Lys Pro 



Lys Arg Glu 
15 

Gly Gin Pro 



file://C:\CRF3\Outhold\VsrI286166B.htm 



11/8/01 
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RAW SEQUENCE LISTING DATE: 11/08/2001 

PATENT APPLICATION: US/09/286, 166B TIME: 09:06:1*9 

Input Set : A:\Seqlist.txt 

Output Set: N:\CRF3\11082001\I286166B.raw 

70 20 25 30 

72 Met Tyr Lys Arg Glu Ala Asp Ala Glu Ala Trp His Trp Leu Gin Leu 

73 35 40 45 

75 Lys Pro Gly Gin Pro Met Tyr Lys Arg Glu Ala Asp Ala Glu Ala Trp 

76 50 55 60 

78 His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 

79 65 70 75 

82 <210> SEQ ID NO: 3 

83 <211> LENGTH: 15 

84 <212> TYPE: DNA 

85 <213> ORGANISM: Artificial Sequence 

87 <220> FEATURE: .j 

88 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic ^ V 

89 sequence 

91 <400> SEQUENCE: 3 

92 aagcttaaaa gaatg 15 

95 <210> SEQ ID NO: 4 

96 <211> LENGTH: 37 

97 <212> TYPE: DNA 

98 <213> ORGANISM: Artificial Sequence 

100 <220> FEATURE: 

101 <221> NAME/KEY: CDS 

102 <222> LOCATION: (1)..(24) 

104 <220> FEATURE: A 

105 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic * 

106 sequence 

108 <400> SEQUENCE: 4 

109 aaa gaa gaa ggg gta tct ttg ctt aagctcgaga tct 37 

110 Lys Glu Glu Gly Val Ser Leu Leu 

111 1 5 

114 <210> SEQ ID NO: 5 

115 <211> LENGTH: 8 

116 <212> TYPE: PRT 

117 <213> ORGANISM: Artificial Sequence 

119 <220> FEATURE: 

120 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic * 

121 sequence 

123 <400> SEQUENCE: 5 

124 Lys Glu Glu Gly Val Ser Leu Leu 

125 1 5 

128 <210> SEQ ID NO: 6 

129 <211> LENGTH: 77 

130 <212> TYPE: DNA 

131 <213> ORGANISM: Artificial Sequence 

133 <220> FEATURE: 

134 <221> NAME/KEY: misc_feature 

135 <222> LOCATION: (29).. (66) j/ 

136 <223> OTHER INFORMATION: Any occurrences of n may be any nucleotide ^ 
138 <220> FEATURE: 
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11/8/01 



RAW SEQUENCE LISTING DATE: 11/08/2001 

PATENT APPLICATION: US/09/28 6, 166B TIME: 09:06:19 



Input Set : A:\Seqlist.txt 

Output Set: N:\CRF3\11082001\I286166B.raw 

139 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

140 sequence 
142 <400> SEQUENCE: 6 

W- - > 14 3 cgtgaagctt aagcgtgagg cagaagcynn knnknnknnk nnknnknnkn nknnknnknn 60 
W--> 14 5 knnknnktga tcatccg C 77 

148 <210>(SEQ ID NO: 7 

149 <211> LENGTH: 19 

150 <212> TYPE: PRT 

151 <213> ORGANISM: Artificial Sequence 

153 <220> FEATURE: 

154 <221> NAME/KEY: VARIANT 

155 <222> LOCATION: (7).. (19) 

156 <223> OTHER INFORMATION: Any occurrences of Xaa may be any amino acid 

158 <220> FEATURE: 

159 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

160 sequence 

162 <400> SEQUENCE: 7 ^ 
W--> 163 Lys Arg Glu Ala Glu Ala (xaa) Xaa^XaA Xaa Xaa Xaa Xaa Icaa XaCa^Xaa 

164 1 ^ /^-^ 5 10 15 

W--> 166 Xaa^Xa^Ca^) 

170 <210> SEQ-fD NO: 8 

171 <211> LENGTH: 36 

172 <212> TYPE: PRT 

173 <213> ORGANISM: Saccharomyces cerevisiae 

175 <400> SEQUENCE: 8 

176 Met Gin Pro Ser Thr Ala Thr Ala Ala Pro Lys Glu Lys Thr Ser Ser 

177 15 10 15 

179 Glu Lys Lys Asp Asn Tyr lie lie Lys Gly Val Phe Trp Asp Pro Ala 

180 20 25 30 

182 Cys Val He Ala 

183 35 

186 <210> SEQ ID NO: 9 

187 <211> LENGTH: 19 

188 <212> TYPE: DNA 

189 <213> ORGANISM: Artificial Sequence 

191 <220> FEATURE: 

192 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic- 

193 sequence 

195 <4 00> SEQUENCE: 9 

196 aagctttcga atagaaatg 19 

199 <210> SEQ ID NO: 10 

200 <211> LENGTH: 36 

201 <212> TYPE: DNA 

202 <213> ORGANISM: Artificial Sequence 

204 <220> FEATURE: 

205 <221> NAME/KEY: CDS 

206 <222> LOCATION: (1)..(27) 

208 <220> FEATURE: 

209 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 



file://C:\CRF3\Outhold\VsrI286166B.htm 



RAW SEQUENCE LISTING DATE: 11/08/2001 

PATENT APPLICATION: US/09/286 , 166B TIME: 09:06:19 



Input Set : A:\Seqlist.txt 

Output Set: N:\CRF3\11082001\I286166B.raw 

210 sequence 

212 <400> SEQUENCE: 10 

213 gcc get cca aaa gaa aag acc teg age tcgettaag 36 

214 Ala Ala Pro Lys Glu Lys Thr Ser Ser 

215 1 5 

218 <210> SEQ ID NO: 11 

219 <211> LENGTH: 9 

220 <212> TYPE: PRT 

221 <213> ORGANISM: Artificial Sequence 

223 <220> FEATURE: 

224 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

225 sequence 

227 <400> SEQUENCE: 11 

228 Ala Ala Pro Lys Glu Lys Thr Ser Ser 

229 1 5 

232 <210> SEQ ID NO: 12 

233 <211> LENGTH: 79 

234 <212> TYPE: DNA 

235 <213> ORGANISM: Artificial Sequence 

237 <220> FEATURE: 

238 <221> NAME/KEY: misc_f eature 

239 <222> LOCATION: (27).. (58) 

240 <223> OTHER INFORMATION: Any occurrences of n may be any nucleotide 

242 <220> FEATURE: 

243 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

244 sequence 

246 <400> SEQUENCE: 12 f v 

W-- > 247 ggtactcgag tgaaaagaag gacaaannkn nknnknnknn knnknnknnk nnknnknnkt 60 

249 gtgttattgc ttaagtacg \ / 79 

252 <210> SEQ ID NO: 13 

253 <211> LENGTH: 22 

254 <212> TYPE: PRT 

255 <213> ORGANISM: Artificial Sequence 

257 <220> FEATURE: 

258 <221> NAME/KEY: VARIANT 

259 <222> LOCATION: (8).. (18) ^Y. 

260 <223> OTHER INFORMATION: Any occurrences of Xaa may be any amino acid 

262 <220> FEATURE: 

263 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

264 sequence 
266 <400> SEQUENCE: 13 

W--> 267 Ser Ser Glu Lys Lys Asp Asn/Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
268 1 "\ 5 / 10 15 

W--> 270 Xaa Xai Cys Val lie Ala I < 5r V-- 
271 I 20 

274 <210> EEQ ID NO: 14 

275 <211> 'LENGTH: 34 

276 <212> TYPE: DNA 

277 <213> ORGANISM: Artificial Sequence 
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RAW SEQUENCE LISTING DATE: 11/08/2001 

PATENT APPLICATION: US/09/286 , 166B TIME: 09:06:19 



Input Set : A:\Seqlist.txt 

Output Set: N:\CRF3\11082001\l286166B.raw 

279 <220> FEATURE: 

280 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

281 * sequence 

283 <400> SEQUENCE : 14 

284 gttaagaacc atatactagt atcaaaaatg tctg 34 

287 <210> SEQ ID NO: 15 

288 <211> LENGTH: 35 - 

289 <212> TYPE: DNA 

290 <213> ORGANISM: Artificial Sequence Ci ~^ 

292 <220> FEATURE: 

293 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

294 sequence 

296 <400> SEQUENCE: 15 

297 tgatcaaaat ttactagttt gaaaaagtaa tttcg 35 

300 <210> SEQ ID NO: 16 

301 <211> LENGTH: 28 

302 <212> TYPE: DNA 

303 <213> ORGANISM: Artificial Sequence 

305 <220> FEATURE: 

306 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

307 sequence 

309 <400> SEQUENCE: 16 

310 ggcaaaatac tagtaaaatt ttcatgtc 28 

313 <210> SEQ ID NO: 17 

314 <211> LENGTH: 34 

315 <212> TYPE: DNA 

316 <213> ORGANISM: Artificial Sequence 

318 <220> FEATURE: 

319 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

320 sequence 

322 <400> SEQUENCE: 17 

323 ggcccttaac acactagtgt cgcattatat ttac 34 

326 <210> SEQ ID NO: 18 

327 <211> LENGTH: 60 

328 <212> TYPE: DNA 

329 <213> ORGANISM: Artificial Sequence 

331 <220> FEATURE: 

332 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

333 sequence 

335 <400> SEQUENCE: 18 

336 ctaaagaaga aggggtatct ttgcttaagc tcgagatctc gactgataac aacagtgtag 60 

339 <210> SEQ ID NO: 19 

340 <211> LENGTH: 31 

341 <212> TYPE.: DNA 

342 <213> ORGANISM: Artificial Sequence 

344 <220> FEATURE: 

345 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 
34 6 sequence 

348 <400> SEQUENCE: 19 
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<210>53 
<211>11 

<212>PRT^ 

<21 ^Artificial Sequent 



<400>53 



^AhU.Fk.ttlHbfk.Fk.AvD.F* 




Page 6 of 7 



VERIFICATION SUMMARY DATE: 11/08/2001 

PATENT APPLICATION: US/09/28 6 , 166B TIME: 09:06:20 

Input Set : A:\Seqlist.txt 

Output Set: N:\CRF3\11082001\I286166B.raw 



L:143 
L:145 
L:163 
L:166 
L:247 
L:267 
L:270 
L:458 
L:489 
L:623 
L:806 
L:806 
L:182 
L:184 



M:341 W 
M:341 W 



M: 
M: 
M: 
M: 
M: 



341 W 
341 W 
341 W 
341 W 
341 W 
M:341 W 
M:341 W 
M:341 W 
M:258 W 
M:258 W 
2 M:341 W 
9 M:341 W 



(46) "n" or "Xaa" 
(46) "n" or "Xaa" 
(46) "n" or "Xaa" 
(46) "n" or "Xaa" 
(46) "n" or "Xaa" 
(46) "n" or "Xaa" 
(46) 
(46) 
(46) 
(46) 



or "Xaa" 
or "Xaa" 
or "Xaa" 
or "Xaa" 
Mandatory Feature 
Mandatory Feature 
(46) "n" or "Xaa 
(46) "n" or "Xaa 



used, for SEQ ID# : 6 

used, for SEQ ID# : 6 

used, for SEQ ID# : 7 

used, for SEQ ID# : 7 

used, for SEQ ID#:12 

used, for SEQ ID#:13 

used, for SEQ ID#:13, 

used, for SEQ ID#:27 

used, for SEQ ID#:29 

used, for SEQ ID#:39 . 

missing, <220> FEATURE: 

missing, <223> OTHER INFORMATION 
" used, for SEQ ID#:122 
n used, for SEQ ID#:124 
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